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PREFACE 


Documented In this report are the results of a study to evaluate the Large 
Area Crop Inventory Experiment clustering and classification procedures In 
terms of variance of the proportion estimates and the probabilities of cor- 
rect classification for three categories. The categories of interest were 
corn, soybeans, and other. 

Timely preparation of the duta and experiment design for this study would 
not have been possible without the aid of several coworkers. K. Lennington 
and D. Register wrote the initial experiment design. R. Abotteen and 
0. Johnson helped to verify the ground- truth labels and to prepare the Initial 
machine processing runs. Their assistance with this study was greatly appre- 
ciated. 
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ACRONYMS 


ANOVA 

analysis of variance 

LAC IE 

Large Area Crop Inventory Experiment 

Landsat 

land satellite 

ME 

machine estimate 

PI 

Procedure 1 

PCC 

probability of correct classification 

PCCl 

probability of correct classification for type 1 dots 

PCC2 

probability of correct classification for type 2 dots 

pixels 

picture elements 

R 

reduction coefficient 

SAE 

stratified areal estimate 

TY 

Transition year 



1 . INTRODUCTION 


In transition from the Large Area Crop Inventory Experiment (lACIE) to the 
LACIE Transition Project (FY79), the basic components of Procedure 1 (PI) 
required investigation. PI, as used in LACIE. was a two-category proce- 
dure estimating wheat and nonwheat. In thn mixed wheat states, a three- 
category classification was used to estimatu winter wheat, spring wheat, and 
other in LACIE Phase III, but no investigation of the appropriate number of 
type 1 dots was made. This study was initiated to test a three-category 
classifier using corn, soybeans, and other as categories to examine the 
appropriate number of type 1 dots. Since a machine estimate (ME) and a 
stratified areal estimate (SAE) were produced by both nearest-neighbor clus- 
tering and maximum-likelihood classification in a standard PI run, all four 
estimates were compared to ground-truth proportions. Each of these four 
proportion estimates were also analyzed in terms of the variance of the 
estimates and the variance of the 'orresponding probabilities of correct 
classification (PCC). The reduction coefficients, R-values, were calculated 
for all processing runs and were compared to previous two-category calculations. 



2 , EXPERIMENT OB«)ECTIVES AND DESCRIPTION 


2.1 OBJECTIVES 

The experiment was designed to test the three-category classifier using corn, 
soybeans, and other as the categories. The objectives of the experiment 
were; 

a. To examine a three-category classifier proportion estimate In terms of 
the number of type 1 dots used 

b. To examine a three-category classifier In terms of the varlacsce of the 
estimate 

c. To examine the evaluation criterion (the PCC) In terms of Its variance 

2.2 GENERAL DESCRIPTION 

The experiment was planned to Include the processing of 12 test segments 
using varying numbers of starting dots (type 1) and 105 bias correction dots 
(type 2). Of these 12 segments, 6 were obtained from the corn and soybean 
allocation and 6 from the LACIE Transition Year (TY) allocation. Detailed 
Information about these data Is described In table 2-1. 

The crops chosen for both the six segments obtained from the corn and soy- 
bean allocation and the six segments obtained from the LACIE TY allocation 
were the major crops in the segment. The primary purpose of the test was to 
process corn and soybeans; however. If one of these crops were not adequately 
represented In a segment, another major crop was chosen to replace it as a 
crop of interest. 

The number of type 1 dots (sets of 30, 45, and 60 dots) was varied in order 
to examine the effect of the number of dots used in a three -category classifier. 
To estimate the variance of the proportion estimates and the PCC, three inde- 
pendent sets of dots were selected from the 209 grid intersections for a fixed 
number of type 1 dots. Thus, a total of 108 processing runs was possible. To 
make the initial type 1 dot select10f;s for a segment, three independent sets of 
60 dots were randomly selected in the usual manner of skipping all border 



TABLE 2-1 DATA SET 


Segment 

Acquisition used 

Major crops 

Number 

location 

146 

Kentucky 

8180, 8198. 8234, 8270 

Corn (C), Soybeans (X) 

185 

Minnesota 

8169, 8197, 8205, 8224 

Corn (C), Soybeans (X), 
Spring wheat, Sunflowers 

804 

Iowa 

8229, 8247, 8274, 8292 

Corn (C), Soybeans (X) 
Oats 

812 

Mississippi 

8199. 8235, 8280, 8289 

Soybeans (X), Cotton (K), 
Rice 

824 

Illinois 

8163, 8235, 8271, 8307 

Corn (C), Soybeans (X) 

883 

Iowa 

8186, 8213, 8222, 8293 

Corn (C), Soybeans (X) 

1075 

Nebraska 

8133, 8206. 8259, 8296 

Corn (C), Alfalfa (A) 

1253 

Oklahoma 

8165, 8184, 8274, 8291 

Soybeans (X), Alfalfa (A) 

1341 

Kansas 

8113, 8167, 8186, 8293 

Corn (C), Soybeans (X) 
Sorghum 

1502 

Colorado 

8138, 8246, 8282, 8300 

j Corn (C), Sugar beets (Y) 
Winter wheat. Alfalfa 

1572 

Nebraska 

8153, 8206, 8279, 8296 

1 Corn (C), Pasture (P) 

1591 

Nebraska 

8134, 8241, 8259, 8278 

Li ,»n ... - . ■■ 

Corn (C), Sorghum (E) 












or edge picture elements (pixels) described In reference 1. Next* IS dots 
were randomly deleted to produce three sets of 45 type 1 dots. And again. 

15 dots were randomly deleted for three sets of 30 type 1 dots. For both of 
these random deletions, each category was guaranteed to have at least one 
type 1 dot, thus restricting the deletion process. 

Each processing consisted of a three-category version of the standard PI 
clustering and classification. Proportion and PCC estimates were obtained 
from the automatically labeled clusters and from the maximum-likelihood 
classifier output. The SAE were also calculated for each ME using a set of 
105 type 2 dots. For the three replications, using a fixed number of starting 
dots (30. 45, or 60), the type 2 dots were selected Independently where the 
overlap (between sets of type 2 dots only) occurred from necessity. The 
abundance of border and edge pixels In the type 1 selections prohibited a 
third set of dots for three of the twelve test segments. This caused the 
total number of processing runs to be decreased to 99. 

2.3 EXPERIMENT DESIGN 

An analysis of variance (ANOVA) was planned for each set of starting dots 
(set of 60, set of 45, and set of 30) to determine differences between the 
proportion estimation procedures. The signed difference between each pro- 
portion estimate and the ground-truth estimate was used for the response 
variable. The linear model for the three analyses was as follows: 

''ijkm * “I * * 'k + 

where 

u = the overall mean of the observations 
0 ^ = the segment effect (i =1,2, 12) 

Tj = the treatment or procedure effect (j = 1, 2, 3, 4) 

C(^ = the crop effect (k = 1, 2) 




^ijkni * •’if'doni error for each observation (m represents the repetitions 
performed for each observation and Is a function of 1. J» and k.) 

'^Ijkm * »*esponse variable 

An ANOVA was also planned for each proportion estimation procedure to deter- 
mine differences between the number of starting dots used (60i 45i and 30). 
The response variable was again the signed difference between the proportion 
estimate and ground truth. The linear model for these four ANOVAs was as 
follows; 


^Ijkm • M + 0^ + Tj + C,^ + 


where 

Tj « the treatment effect representing the number of starting dots used 
is = 1. 2. 3) 


M 


c 


°f 

Ck 

ijkm 


X the overall mean of the observations 
= the segment effect (1 » 1 , 2, 12) 

= the crop effect (k = 1* 2) 

= the random error for each observation (m represents the repetitions 
performed for each observation and as a function of 1 , j , or k. ) 


A general linear model ANOVA program was used to generate the ANOVA tables 
{ref. 2). 


To examine the variability in the performance of PI, estimates of the variance 
of the proportion estimates and the variance of the PCC estimates were to be 
computed. The variances were estimated by pooling the within-segment vari- 
ances over each segment for each case of 30, 45, and 60 type 1 dots. These 
variances were then pooled over all segments for each case of 30, 45, and 60 
type 1 dots. The equations for computing these variance estimates are as 
follows. 




Var [X] 




2 


where 

X^j * the variable* proportion estimate* or PCC as measured for the 
sampling and Jth segment 

Xj ■ the average value of X|j for the Jth segment 
N ■ total number of segments and range of J 

Mj >* total number of samplings which are dependent upon S and are In the 
range of 1 

Separate comparisons were planned for the variance of the proportion estimates 
and the variance of the PCC estimates. In each case* ratios between the 
variances for the es'^^'.’ats? of the set of 45 type 1 dots and the set of 60 
type 1 dots and bet.weeA' the variances for the estimates for the set of 30 and 
the set of 60 type 1 dots were to be calculated. These ratios were approxi- 
mately distributed as F-statlstIcs and, therefore* may be tested for statis- 
tically significant departures from unity. Statistical tables Indicated that 
ratios with a value of approximately two or larger were significant at the 
5-percent level If a total of 12 segments was used. 



3. PROCEDURAL DESCRIPTION 


This study was performed using ground-truth labels that were manually veri- 
fied with an annotated aerial photograph and registered grid overlay. The 
grid overlay corresponds to the grid Intersections on the land satellite 
(Landsat) film products. Border (spcrr^ally mixed pixels) and edge (spatially 
misregistered pixels from acquisition to acquisition) were also Identified and 
documented at this time since these types of pixels are not used as type 1 
dots. 


Standard PI processing was performed. The type 1 dots started the nearest- 
neighbor clustering algorithm (ref. 3) with the following parameters. 


a. 

CO 

« 

60 

b. 

Percent 

0 

0 

c. 

SEP 

a 

1 

d. 

STDMAX 

a 

20 

6 « 

OLMIN 

a 

0 

f. 

R2 


8191 

9* 

NMIN2 

= 

18 

h. 

ITMAX 

= 

0 

1 

P of N 

= 

1 

j. 

SC Seq. 

= 

S 

k. 

Distance measure 

a 

L2 


The NMIN2 parameter was changed from the standard value of 100 to 18 In 
order to prevent the delet1;»n of small clusters. 


The clusters were automatically labeled by the closest type 1 dot using an 
L2 distance criterion. The cluster statistics were then used In a maximum- 
llkellhood classifier to classify the segment. Output reports Included 




cluster proportion estimates* classification proportion estimates, their 
corresponding SAEs, type 1 PCCs, and type 2 PCCs. 

Initially, the three-catogory version of PI was run using each set of 60 
type 1 dots for each segment. Following the completion of these runs, 15 
type 1 dots were deleted at random from each set of type 1 dots and the 
processing was repeated using the 45 ri(i!a1n1ng dots. Finally, 15 more type 1 
dots were deleted at random, and the 30 remaining dots were used to make the 
final runs. 




4. RESUtTS 


The estimates obtained from the study are shown In tables 4-1a, 4<-1b» and 
4-1 c for the sets of 60, 45, and 30 type 1 dots, respectively. The Cl and 
C2 are the two categories of Interest that were processed with other (N). 

The ground- truth estimates of these categories are 400-random-dot counts, 
taken from annotated aircraft photography because digitized ground-truth maps 
were not available, The ME, SAE, type 1 dot PCC (PCCl), and type 2 dot PCC 
(PCC2) are shown for both cluster and classification results. The SAEs were 
computed on a category level for both MEs. Note that the PCC values are 
computed for the ME only. 

The raw proportion estimates were differenced with the ground truth before 
a.ialysls, and these signed differences appear In the appendix, table A-la, 
A-lb, and A-lc for 60, 45, and 30 type 1 v' respectively. 

The first set of ANOVA tests was performed on the signed differences between 
proportions and ground truth for each different number of type 1 dots: 60, 

45, »ind 30. This was to determine 1f any significant differences existed 
between the four methods of achieving a proportion estimate. These ANOVA 
tests appear 1n table 4-2a, 4-2b, and 4-2c, For each separate set of starting 
dots, no significant differences were found between the proportion estimates. 

The ANOVA tests were also performed to detect differences between 30, 45, 
and 60 starting dots, based on the signed differences between the proportions 
and ground truth. These ANOVA tests appear in tables 4-3a, 4-3b, 4-3c, and 
4-3d. No significant differences were found between numbers of starting dots 
(60, 45, and 30) for each proportion estimation technique: machine cluster- 
ing, SAE clustering, machine classification, and SAE classification. 

The variance of the proportion estimates on a per segment basis appear In 
the appendix, tables A-2a, A-2b, and A-2c for 60, 45, and 30 starting dots, 
respectively. Comparisons were more readily made when these variances were 
pooled over all the test segments for each number of starting dots, as In 












































































TABLE 4-1 Continued 


(b) 45 type 1 dots 



Sequent 

GrouAd-trutb data 



Cluster 





Classify 



Hechfne 

estlnote 

Stratified areal 
estinate 

PCCl 

PCC2 

Rachine 

estimate 

Stratified areal 
cstinate 

PCCl 

PCC 2 

Pfcaber 

Locatfon 

Crop 1 

Percent 

Crop 2 

Percertt 

Crap 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

1075 

Jlebfoska 

C 

29 

n 

8 

17 

4 

28 

14 

100.0 

72,4 

17 

4 

27 

14 

1X.0 

70.9 







25 

7 

33 

12 

100 

73.1 

24 

7 

34 

12 

IX 

76.7 





H 


26 

6 

33 

10 

100 

78.1 


6 

33 

10 

IX 

79.8 

1341 

Kansas 

n 

40 

n 

14 

47 

17 

35 

17 

97.8 

53.3 

47 

17 

36 

17 

97.8 

55.9 







37 

21 

39 

15 

100 

53.3 

37 

20 

39 

16 

IX 

58*1 



H 




44 

23 

33 

15 

100 

X.5 

44 

22 

39 

16 

IX 

55.2 

1591 

Hebresfca 


15 

£ 

7 

17 

4 

17 


33.3 

79.4 

17 

4 

17 

3 

88.9 

8a4 







ID 

1 

17 


100 

79-0 

9 

] 

18 

5 

IX 

81 *D 



H 




14 

8 

16 

■I 

97.7 

78,8 

T4 

7 

17 

6 

IX 

X.4 

146 

Kentuciqr 

H 

17 

X 

46 

12 

SO 

T« 

B9 

97,8 

B1.0 

12 

X 

19 

45 

97-8 

81*9 






17 

45 

18 

Kfl 

100 

82.0 

16 

45 

17 

43 

IX 

83*1 

185 

Hlwiesoti 

c 

B 

X 

8 

4 

7 

5 

e 

95.5 

B4.8 

4 

7 

5 

6 

97.7 

M.2 







3 

2 

5 

n 

IX 

85.7 

3 

2 

5 

11 

IX 

85.4 







1 

a 

e 

7 

IX 

X.O 

1 

8 

8 

7 

IX 

81*6 

BOI 

Zcna 

c 

46 

X 

29 

44 

26 

40 

34 

IX 

79.0 

a 

26 

40 

34 

IX 

78.8 







45 

27 

42 

30 

IX 

81.0 

45 

27 

42 

29 

IX 

X.8 







45 

28 

41 

28 

IX 

81.9 

44 

28 

41 

28 

57.8 

81.7 

812 

IHssIssIppI 

X 

43 

K 

7 

43 

7 

50 

5 

95.6 

X.O 

48 

8 

49 

6 

95.5 

ei-7 






48 

3 

49 

4 

IX 

B3.B 

48 

3 

S3 

5 

IX 

X.8 







50 

6 

43 

7 

IX 

78.1 

49 

6 

43 

7 

97*7 

78.8 

824 

nifiiDis 

c 

52 

X 

41 

54 

42 

54 

34 

97.8 

77.5 

54 

42 

53 

35 

97*8 

78.8 







51 

43 

47 

44 

IX 

81.6 

51 

43 

49 

43 

IX 

82*3 

883 

lout 

c 

37 

X 

33 

35 

31 

41 

34 

97.8 

66.7 

35 

32 

42 

32 

95*6 

66.3 







38 

30 

38 

33 

IX 

70.9 

38 

30 

X 

33 

97*8 

72.9 

1253 

OfcUlioBe 

X 

33 

A 

3 

35 

1 

29 

2 

97*8 

86.8 

35 

1 

32 

a 

IX 

X*S 







27 

1 

33 

3 

95.6 

61.9 

27 

1 

33 

3 

95.6 

82*5 







33 

1 

2B 

2 

IX 

64.3 

33 

1 

2B 

1 

TX 

86.1 

1f02 

Colorado 

c 

11 

T 

4 

7 


6 

7 

IX 

82,9 

7 

2 

6 

6 

IX 

82*4 







3 

i 

12 

5 

IX 

87.6 

3 

1 

12 

5 

IX 

87.4 







n 

7 

B 

8 

IX 

85.7 

10 

7 

9 

7 

IX 

85.6 

1572 

Rebmsia 

c 

14 

P 

64 

6 

81 

14 

SB 

95.6 

67.6 

6 

ST 

■9 

59 

95.6 

68.6 







14 

75 

14 

54 

95.6 

70.5 

14 

75 


56 

93.3 

72.B 







5 

72 

14 

6 

93.3 

69.5 

6 

72 

HU 

lJI 

93.2 

7D.5 


<leftn1t1ons: 
Crop codes 
A « KlfalfA 
C * com 


K • cotton 
P ■ posture 
X • soytetns 
T « &ti 9 tr beets 













































TABLE 4-1 Concl uded 


(c) 30 type 1 dots 


1 

Ground-truth data 

Cluster 

Classify 

Machine 

estiwce 

Strat1fi«a areal 
estimte 

B 

PCC2 

Machine 

C5tir:ate 

Ctra tinted areal 
esttrsate 

PCC1 

1 

PCC2 

nmtter 

Location 

Crop 1 

Percent 

Crop 2 

Percent 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Cmp 1 


Crcp 1 

esB 

107$ 

Mcbrasfea 

c 

29 


a 

17 

2 

29 

13 

100.0 

67.6 

T7 

1 

25 

12 

TX.O 

68.2 







26 

10 

33 

ID 

IX 

74.0 

26 

:o 

33 

72 

IX 

76.7 







£3 

5 

mm 

£ 

IX 

74.1 

23 

e 

■■ 

3 

TX 

J9.£ 

1341 

Unus 

C 

4D 


u 


le 

^9 

le 

w.s 

56.2 

52 



17 


$6.3 







31 

29 


14 

95.7 

52.4 

3T 


44 

14 

IX 

£6.2 







46 

31 

mm 

IS 

IK 

45.7 

46 

IT 

37 

16 

96.6 

47,6 

t»l 

Mdiraslcj 

c 

1$ 

E 

7 

35 

3 

IS 

3 

56,7 

K99 

tz 

2 

77 

mm 

55.7 

e:.D 







10 

1 

17 

z 

IK 

■ iU 

TO 

: 

16 


7X 

75.6 







10 

6 

16 

■■ 

100 

■19 

n 

4 

16 

H 

3X 

75.7 

146 

Kcfktuctcy 

c 

17 

X 

46 

14 

S3 

20 


BOX 

71-4 

14 

54 

27 


36-7 

£5.9 







16 

45 

17 

■9 

100 

75.3 

16 

45 

18 

mm 

IX 

83-1 

U5 

Minnesota 

c 

8 

X 

£ 

0 

5 

5 

6 

96-6 

33.3 

0 

5 

c 

5 

56.6 

SS.I 







2 

3 

3 

9 

100 

33-8 

2 

*3 

3 

10 

IX 

34. C 







6 

3 

a 

6 

96.7 

76.2 

5 

1 

3 

6 

£5.7 

7£.2 

sot 


c 

46 

X 

23 

44 

2£ 

41 

35 

100 

76.2 

4$ 

27 

45 

34 

IX 

77.7 







49 

27 

42 

29 

m 

79.0 

43 

27 

43 

29 

IX 

79-6 







43 

26 

42 

23 

10G 

3T.D 

47 

27 

41 

22 

IX 

81.6 

aiz 

Mississippi 

t 


K 

7 

43 

5 

43 

6 

100 

73.1 

42 

5 

49 

£ 

TX 

ei.7 







60 

3 

47 

6 

IK 

£2.9 

X 

4 

52 

7 

IX 

8G.4 







£5 

7 

43 

Z 

53.3 

78.1 

54 

7 

42 

6 

9G.e 

75.8 

SZ4 

nilnofs 

c 


X 

41 

£7 

33 

SO 

37 

IX 

77.5 

57 

39 

49 

33 

TOO 

75.3 







ST 


47 

42 

m 

75.5 

X 


49 

41 

IX 

72,9 

333 

lorn. 

c 

37 

X 

33 

42 


23 

34 

100 

64, e 

41 


40 

-ic 

56.7 

66.3 







44 


34 

37 

u» 

72.7 



56 

34 

IX 

72.1 

12S2 

OH a tow 

X 

33 

A 

3 

32 

: 

29 

2 

95.7 

B6.a 

99 

1 

31 


IX 

86.7 







24 

3 

33 

3 

IK 



3 

33 

4 

3X 

ei.7 







■9 

1 

2£ 

2 

IX 

&t.£ 

99 

1 

2S 

1 

IX 

06-1 

1502 

Colorado 

c 

n 

y 

4 

H 

2 

6 

7 

IX 

32-9 

^B9 

2 

7 

6 

IX 

£2.8 








1 

12 

< 

IX 

37.6 


1 

12 

5 

IX 

£7.5 









9 

7 

IX 

£9.5 

72 



7 

TX 

39-9 

1572 

Kebrasfca 

c 

u 

p 

G« 


£B 

15 

58 

83.3 

70-5 


37 

■9 

55 

83.5 

72,4 








01 

12 

57 

96 .a 

63,6 


el 


5S 

96.0 









71 

14 

66 

S3.3 

63.6 

1^1 

72 

ttl 

£6 

30. S 



Symbol tfeflnltlOfis: 



Cr^ 2 _Od« 

A « 

C * tom 
£ • sor^hiB 
K * cscton 
P « p«ttur« 
t • soytMns 
r - sugar b«ts 



























































TABLE 4-2.- PROPORTION ESTIMATES ANOVA USING 60, 45, 
AND 30 STARTING DOTS 


Source of variation 

Degrees of 
freedom 

Sum of 
squares 

Mean 

square 

F-value 

(a) 

Using 60 starting dots 



Mean, segment, and 
crop effect 

13 

505.31 



Estimation procedure 
effect 

3 

32.59 

10.86 

0.85® 

Error 

248 

3149.10 

12.70 


Total 

264 

3687.00 



(b) 

Using 45 starting dots 



Mean, segment, and 
crop effect 

13 

697.02 



Estimation procedure 
effect 

3 

10.80 

3.60 

0.25® 

Error 

248 

3558.17 

14.34 


Total 

264 

4266.00 



(c) 

Using 30 starting dots 



Mean, segment, and 
crop effect 

13 

1223.56 



Estination procedure 
effect 

3 

1.83 

0.61 

0,03® 

Error 

248 

5564.61 

22.44 


Total 

264 

6790.00 




^Indicates nonsignificance at the a = 5-percent level. 




























TABLE 4-3.- STARTING DOTS ANOVA USING MACHINE CLUSTERING PROPORTIONS, 
SAE CLUSTERING PROPORTIONS, MACHINE CLASSIFICATION PROPORTIONS, 

AND SAE CLASSIFICATION PROPORTIONS 


Source of variation 

Degrees of 
freedom 

Sum of 
squares 

Mean 

square 

F-value 

(a) Machine clustering proportions 

Mean, segment, and 
crop effect 

13 

1918.96 



Starting dot effect 

2 

14.92 

7,46 

0.38® 

Error 

183 

3569.12 

19.50 


Total 

198 

5503.00 



(b) 

SAE clustering proportions 


Mean, segment, and 
crop effect 

13 

461.60 



Starting dot effect 

2 

3.67 

1.83 

0.22® 

Error 

183 

1550.73 

8.47 


Total 

198 

2016.00 



{c) Machine classification proportions 

Mean, segment, and 
crop effect 

13 

1968.49 



Starting dot effect 

2 

17.04 

8.52 

0.46® 

Error 

183 

3394,47 

18.55 


Total 

198 

5380.00 



(d) SAE classification proportions 

Mean, segment, and 
crop effect 

13 

487.19 



Starting dot effect 

2 

0.39 

0.20 

0.03® 

Error 

183 

1356.42 

7.41 


Total 

198 

1844.00 




®No significant difference was found at the a = 5-percent level. 










































table 4-4. The variances of the MEs, both clustering and classification, were 
significantly decreased by Increasing the number of starting dots from 30 to 
45 for both categories of Interest. These variances were again decreased, 
but not significantly, by Increasing the numbers of starting dots from 45 to 
60. The variances of the SAE estimates, both clustering and classification, 
did not significantly differ for any change in the number of starting dots. 

For 60 starting dots, no significant differences were found between any of 
the proportion estimation techniques for either crop. 

The variances of the PCCs on a per segment basis appear 1n the appendix, 
tables A-3a, A-3b, and A-3c for 60, 45, and 30 starting dots, respectively. 
These variances are pooled over all test segments for each number of starting 
dots and appear In table 4-5. The variance of the PCCl for clustering and 
classification was significantly decreased when the number of starting dots 
changed from 30 to 45 and from 45 to 60. The variance of the PCC2 for cluster- 
ing and classification was significantly decreased when the number of start- 
ing dots changed from 30 to 45. When the number of starting dots changed from 
45 to 60, the variance of PCC2 increased for both clustering and classifica- 
tion, but the Increase was not significant. For 60 starting dots, the variance 
of PCCl was significantly different from the variance of PCC2 for both clus- 
tering and classification. This significance can be attributed to the dif- 
ference between training and test data. 

The reduction coefficient, R, has been presented as a method of observing how 
much the machine classification reduces the variance of the SAE proportion 
estimation (ref. 3) in comparison with the variance of a simple random sample 
estimate. In the computation of the R-values, the omission and commission 
rates are computed by comparing the machine labels of the type 2 dots to the 
ground truth. The sampling error can be computed using the ground truth 
proportions or the labeling proportion from the type 2 dots denoted herein 
as 7T. The R-values were computed for both clustering and classification for 
all of the three-category runs. The omission and commission rates and the 
R-values are presented, for both ground truth and labeling proportions. In the 
appendix, tables A-4a, A-4b, and A-4c. The R-values were then averaged over 
the 33 runs for a particular number of starting dots (60, 45, or 30) and using 



TABLE 4-4,- VARIANCES^ OF THE PROPORTION ESTIMATES 


Number of 

Crop 

Cluster 

Classify 

starting dots 

Machine 

Bias 

Machine 

Bias 


w 

23.389 

5.562 

22.437 

6.549 

30 

H 

14.021 

6.312 

12.875 

4.597 


■ 

11.285 

6.181 

10.625 

6.111 

45 

H 

6.479 

7.431 

6.181 

5.076 



7.715 

8.194 

8.007 

6.542 

60 

HI 

3.493 

5.125 

3.083 

5.264 


^Statistical significance at the 5-percent level Is found 
whenever the ratio of variances Is at least two. 


TABLE 4-5.- VARIANCES OF THE PCC 


Number of 
starting dots 

Cluster 

Classify 

PCCl 

PCC2 

PCCl 

PCC2 

30 

16.751 

10.992 

12.782 

14.446 

45 

3.053 

4.590 

5.254 

5.484 

60 

1.138 

5.725 

0.954 

7.408 
























both the ground-truth and the labeling proportion. The average R-va1ues and 
their corresponding variances are presented In table 4-6. The averaged 
R-values were only slightly higher than those for the two-category PI runs 
(0.718 and 0.714) as documented in reference 3. 




TABLE 4-6.- THE R-VALUES AVERAGED OVER 33 THREE-CATEGORY RUNS 


Number of 
starting dots 

IT 

Cl uster 

Classify 

R-mean 

R-variance 

R-mean 

R-variance 

60 

Ground truth 

0,738 

0.024 

0.731 

0.034 

60 

Labeling proportion 

.739 

.025 

.729 

.032 

45 

Ground truth 

.736 

.032 

.730 

.029 

45 

Labeling proportion 

.735 

.032 

.729 

.030 

30 

Ground truth 

.775 

.029 

.746 

.034 

30 

Labeling proportion 

.774 

.029 

.746 

.034 















5. CONCLUSIONS AND REC0M1ENDATI0N5 


Upon examination of both ME and SAE proportion estimates produced by cluster- 
ing and classification* no significant differences were found between the 
proportion estimates and ground-truth estimates. Since this was the case In 
previous two-category studies (ref. 4). It is not considered unusual in the 
three-category case, but Instead, Indicates that conclusions should be made 
on the basis of the consistency or variance of the estimates as well as the 
accuracy. 

When testing the variances of the ME proportion estimates, a significant 
reduction In the variances was found when the number of starting dots was 
Increased from 30 to 45. The variances were again reduced, although not 
significantly, when the number of starting dots was Increased from 45 to 60. 
From these results, 60 starting dots are reconmended for a three-category 
classifier. 

When examining the variances of the estimates for the four estimation proce- 
dures (using 60 starting dots), no significant differences were found between 
procedures. Thus, only the machine clustering may be used to produce an 
estimate and the SAE computations and maximum-likelihood classification can 
be deleted. This will allow two advantages over PI: (1) using only cluster- 

ing will eliminate the additional machine time required by classification, 
and (2) deleting the SAE will minimize the analyst-labeling-time required 
because only type 1 dot labeling will be necessary. 

The variance of the PCCl was significantly lower for 60 starting dots than 
for either the set of 45 or 30 dots. Since the type 1 dots are the training 
data, an Increase in the training sample size Is expected to produce signif- 
icant decreases in the variance of the PCCl. For the PCC2, a significant 
reduction in the variance was observed when the number of starting dots was 
increased from 30 to 45. No significant differences were observed when the 
number of starting dots was increased from 45 to 60. Thus, the variance of 



PCC1 decreased when the number of starting dots Increased up to 60# and the 
variance of PCC2 decreased when the umber of starting dots Increased to 45 
and then statistically stabilized. This further' reinforces the choice of 60 
starting dots. 

The efficiency of PI In reducing the variance of a proportion estimate 
obtained from SAE has been presented In reference 3. In this experiment# 
virtually no difference existed between the R-values# regardless of the 
number of starting dots used or the proportion estimation procedure. There 
were no cases where the R-value was lower for clustering than for class1f1ca< 
tion. This would Indicate that classification was better than clustering* 
but the differences between the R-values were consistently very small. As 
In the twc-category case# these R- values Indicate that not much Is being 
gained by classification or clustering over a simple random sample. Since 
these R-values are the product of the best possible labeling of dots (ground 
truth}# an Improved procedure to PI seems desirable to Improve the cost- 
effectiveness of this machine processing. 

To sunmarize, the recommendations resulting from this study are as follows: 

a. A set of 50 starting dots should be used in a three-category classifier. 

b. The ME produced by nearest-neighbor clustering Is an adequate estimator. 

c. More study is needed in the area of an alternative for PI. 
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APPENDIX 


Tables A-1 through A-5* Included In the appendix, are supplemental material 
referred to In section 4 of this document. 




TABLE A-1.- DIFFERENCES BETWEEN PROPORTIONS ANO GROUND TRUTH 
USING 60, 45, ANO 30 STARTING DOTS 


(a) Using 60 starting dots 




Crop 1 

Crop Z 

Cluster 

Classify 

< 

iegimnt 

HacMne estlmste 

stratified areal 
estimate 

Machine estimate 

Stratified areal 
estimate 

Number 

Location 

Crop T 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

1076 

Nebraska 

c 

mm 

"13 

-4 

-1 

6 

-13 

.5 

-1 

6 





-3 

0 

7 

3 

-4 

0 

7 

3 




H 

-3 

-2 

5 

■■ 

-3 

-2 

4 

4 

1341 

Kansas 

c 

n 

e 

1 

-5 


mm 

1 

"5 

4 





1 


-1 



3 

-3 

2 





2 


■■ 


HI 

6 

-3 

3 

)591 

Nebraska 


E 

0 




1 

-3 

1 

"4 





-5 




-5 

.4 


-2 





-1 

■1 

0 

-1 

"1 

"2 


0 

146 

Kentucky 


X 



3 

0 


-2 







WM 

0 

-1 

-2 





185 

Ninnesota 


X 


-2 

"1 

-2 


.2 

"1 







"6 

-2 

2 

-6 

-6 

-2 






mm 

"4 

0 

-2 

mm 

■■ 

0 

^bh 

804 

Iowa 


X 


-3 

"6 

5 


^Bl 

"S 

HI 






"2 

-2 



•2 

*4 






"3 

•2 

"5 

-2 

Hfl 

-2 

-5 

"2 

812 

Mississippi 

X 

K 

1 

"2 

2 

-2 


"2 

2 

-1 





2 

-4 


-1 

1 


5 

-1 





3 

"2 


■9 


H 

-6 

■9 

824 

Illinois 

c 

X 

3 





H9 

■9 






0 

2 

*6 

3 


2 


mm 

8B3 

Iowa 

c 

X 

1 

-3 

3 

1 

1 

-2 


HI 





3 

-2 

"1 



-2 

1 


1253 

Oklahoma 

X 

A 

-1 

-2 

"4 

-1 

-1 

-2 

-2 

.2 






-2 

0 

HI 


-2 

0 

■h 





-1 

-2 

-7 

HI 

-1 

"2 

-6 

HI 

1502 

Colorado 

c 

Y 

-2 

-3 

"5 


-2 

•3 

-5 






-8 

"3 

1 

H9 

-8 

-3 

1 






"1 

1 

-3 

3 

■9 

1 

-2 

HI 

1572 

Nebraska 

c 

? 

"8 

12 

■9 

-5 


12 

■9 







11 


"10 

H9 

11 

mm 

-10 





-7 

5 


-1 

9oi 

6 

0 

1 


Symbol definittons; 
Crop codes 
A « alfalfA 


C ■ corn 
t ■ sorghun 
K ■ cotton 
P > pasture 
X ■ soybeans 
V * sugar beets 




































































TABLE A“l.~ Continued. 


(b) Using 45 starting dots 


Segment 

Crop 1 

Crop 2 

Cluster 

Classify 

Machin* estliute 

stratified areal 
estimate 

Machine estimate 

Stratified area) 
estimate 

Number 

Location 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

1075 

Nebraska 

C 

mm 

-IZ 



6 

•12 

■■ 

-2 

6 





-4 



4 

-S 


5 

4 





-3 



2 

-3 


4 

2 

1341 

Kansas 

C 

n 

7 



3 


^B 

mm 

3 





-3 

7 


1 


mm 


2 





4 

■■ 


1 


8 

BI 

2 

1591 

Nebraska 

C 

E 

2 




2 



•4 





-5 




-6 



-2 





-1 

1 

1 


-1 

mm 


-1 

146 

Kentucky 

c 

X 

-S 

4 

1 

1 

-6 

BI 

^^1 

-1 






-1 

1 

-2 

-1 

BH 


-3 

185 

Minnesota 

c 

X 

-4 

-1 

-3 

-2 

mm 

-1 

-3 

-2 





-5 

-6 

-3 

3 


-6 

-3 

3 





-7 

0 

0 

-1 

bI 



-1 

804 

Iowa 

c 

X 

-2 

-3 

-6 

5 


-3 

.6 






-1 

*2 

-4 

1 


-2 







-1 

-1 

-5 

•1 

-2 

-1 


9B 

812 

Mississippi 

X 

K 



2 

-2 

B9 

1 

mU 

-1 





0 

•4 

1 

-3 




-2 





2 

•1 

-5 


Bfl 

BI 

mm 


824 

Illinois 

c 

X 

2 

1 

2 

-7 



1 

-6 





■■ 

2 

-5 

3 

-1 


-3 

2 

803 : 

Iowa 

1 c 

X 


-2 

1 



^H| 

5 

-1 





1 

-3 

1 

0 


mm 

■■ 

mm 

1253 

Oklahoma 

X 

! A 


-2 

-4 

-1 










-2 




-2 

■n 







-2 

-5 

-1 

bI 


-5 

-2 

1502 

Co1 orado 

c 

V 


-2 

-5 

3 



-5 

2 





-B 

-3 


1 

-6 


1 

1 






3 


Bl 

-1 


-2 


1572 

Nebraska 

c 

P 

-8 

17 



-6 

17 







0 

11 

HS 

-10 


11 

-2 

-8 





-9 

8 

B9 

1 


6 

il^B!^9 

0 


Synibol definitions: 
Crop codes 
A - alfalfa 
C corn 
E ■ sorghum 
K ” cotton 
P ■ pasture 
X * soybeans 
Y « sugar beets 
































































TABLE A-1.- Concluded. 


(c) Using 30 starting dots 






Clutter 

Clasalfy 

Segment 

Crop 1 

Crop 2 

Machine estimate 

stratified areal 
ettfnate 

Machine eitlnate 

stratified areal 
estimate 

Number 

Location 



Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 



Crop 1 

Crop 2 

1075 

Nebraska 

C 

■a 

-12 

•6 

0 

S 

-1Z 

mgm 

0 






-13 

2 

4 

2 

-3 

■9 

4 





H 

-1 

-2 

3 

■1 

-1 


2 


1341 

Kansas 

n 


13 

4 

-B 

HI 

12 


-3 

3 1 





-9 

15 

3 


-9 








6 

17 

-z 

HI 

6 

HI 


2 

1591 

Nebraska 


E 

0 

-4 


19 

0 

-5 


-4 





-5 

-6 

2 


-5 

-6 

1 

-2 





-5 

-2 

1 

99 

■■ 

-3 

1 

0 

146 

Kentucky 

c 

X 

-3 

7 

3 

1 


8 


-1 




1 

-2 

0 

■■ 

0 

-2 


-4 

IdS 

Hinnesota 

c 

X 

-8 

-3 

-3 



-3 


-2 






•S 

-5 

1 

*6 

-5 

-s 

2 





MM 

-5 

■■ 


-3 

-5 


-2 

804 

iDMa 

c 

X 


-3 



-1 

•2 

-6 

6 






-2 


0 

2 

-2 

-3 

0 





HI 

-3 

HI 

-1 

1 

-2 

-5 

-1 

812 

Mississippi 

X 

K 


-2 

■9 

*1 

-6 

-2 

1 

-1 




-1 

-4 

0 

-2 

-1 

-3 


0 





■■ 


-5 

1 

6 



-1 

824 

Illinois 

c 

X 


-3 

-2 

-4 

5 

-2 







MM 

-2 

-3 

1 

5 

-2 


B^H 

883 

lows 

c 

X 


-1 

2 

1 

4 









■■ 

-3 

■I 

6 


-1 

^H 

1253 

Oklahoma 

X 

A 

-1 


-4 

^H 

0 


-2 






-9 

Mm 

0 


-9 



1 






-2 

-5 

HI 

99 

-2 

-5 

^9 

1502 

Colorado 

c 

Y 

-7 

-2 

-S 



-2 

MM 






-8 

-3 

1 

Kl 


-3 







■■ 

-1 

-2 

3 

1 

-1 

HI 

^H 

1572 

Nebraska 

c 

P 



1 

-6 

-7 

23 

1 




1 



17 



-5 

17 

-3 






-B 

7 

MM 


-8 

8 


99 


S^bol definitions: 
C rop codes 
A * alfalfa 


C • corn 
E ■ sorghum 
K « cotton 
P ■ pasture 
X o soybeans 
V • sugar beets 

































































TABLE A-2.- VARIANCES C: THE PROPORTION ESTIMATES USING 60. 45, AND 30 STARTING DOTS 

(a) Using 60 starting dots 


Segment 

number 

Cluster 

Classify 

Machine estimate 

Stratified areal 
estimate 

Machine estimate 

Stratified areal 
estimate 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

1075 

33.333 

4.000 

17.333 

2.333 

30.333 

6.333 

16.333 

2.333 

1341 

7.000 

6.333 

4.333 

0.333 

10.333 

6.333 

1.333 

1.000 

1591 

7.000 

4.000 

4.333 

1.000 

9.333 

1.000 

0.333 

4.000 

185 

2.333 

4.000 

1.000 

5.333 

2.333 

4,000 

1.000 

4.333 

804 

4.000 

0.333 

4.333 

13.000 

4.000 

0.333 

0.333 

9.333 

812 

1.000 

1.333 

28.000 

1.000 

1.000 

1.333 

32.333 

0.333 

1253 

0.333 

0.0 

12.333 

0.333 

0.333 

0.0 

9.333 

1.000 

1502 

14.333 

5.333 

9.333 

4.333 

14.333 

5.333 

9.000 

l.QOO 

1572 

19.000 

14.333 

0.333 

20.333 

21.333 

10.333 

0.0 

30.333 

146 

1.000 

1.000 

4.000 

1.000 

0.250 

1.000 

2.25l 

0.250 

824 

2.250 

1.000 

9.000 

12.250 

2.250 

1.000 

4.000 

9.000 

883 

1.000 

0.250 

4.000 

0.250 

0.250 

0.0 

2.250 

0.250 




TABLE A-2.— Continued 



(b) Using 45 starting dots 


Segnent 

number 

Cl uster 

Classify 

Machine estimate 

Stratified areal 
estimate 

Machine estimate 

Stratified areal 
estimate 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

1075 

24.333 

2.333 

8.333 

4.000 

23.333 

2.333 

14.333 

4.000 

1341 

26.333 

9.333 

4,333 

1.333 

26.333 

6.333 

3.000 

0.333 

1591 

1 

12.333 

12,333 

0.333 

4.000 

16.333 

9.000 

0.333 

2.333 

185 

2.333 

10.333 

3.000 

7.000 

2.333 

10.333 

3.000 

7.000 

804 

0.333 

1.000 

1.000 

9.333 

0.333 

1.000 

1.000 

10.333 

812 

1.333 

4.333 

14.333 

2.333 

0.333 

6.333 

25.333 

1.000 

1253 

17.333 

0.0 

7.000 

0.333 

17.333 

0.0 

7.000 

1.333 

1502 

1 

16,000 

10.333 

9.333 

2.333 

12.333 

10.333 

9.000 

1.000 

U72 

24.333 

21.000 

0.0 

31.000 

21.333 

21.000 

1.333 

16.333 

146 

6.250 

6.250 

0.0 

2.250 

4.000 

6.250 

1.000 

1.000 

824 

2.250 

0.250 

12.250 

25.000 

2.250 

0.250 

4.000 

16.000 

883 

2.250 

0.250 

2.250 

0.250 

2.250 

1.000 

4.000 

0.250 




TABLE A-2.- Concluded. 

(c) Using 30 starting dots 



Segment 

number 

Cluster 

Classify 

Machine estimate 

Stratified areal 
estimate 

Machine estimate 

stratified areal 
estimate 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

Crop 1 

Crop 2 

1075 

34.333 

16.000 

4.333 

6.333 

34.333 

20.333 

4.000 

7.000 

1341 

126.333 

49.000 

16.333 

4.333 

117.000 

46.333 

16.333 

2.333 

1591 

8.333 

4.000 

1.000 

4.000 

7.000 

2.333 

0.333 

4.000 

185 

6.333 

1.333 

6.333 

3.000 

6.333 

1.333 

6.333 

5.333 

804 

7.000 

0.333 

0.333 

14.333 

2.333 

0.0 

2.333 

10.333 

812 

36.333 

4.000 

9.333 

1.333 

37.333 

2.333 

26.333 

0.333 

1253 

24.333 

1.333 

7.000 

0.333 

27.000 

1.333 

6.333 

2.333 

1502 

24.333 

1,000 

9.000 

3.000 

21.333 

1.000 

6.333 

1.000 

1572 

2.333 

73.000 

2.333 

24.333 

2.333 

57.000 

4.000 

19.000 

146 

1.000 

16.000 

2.250 

6.250 

1.000 

20.250 

2.250 

1.000 

824 

9.000 

2.250 

2.250 

6.250 

12.250 

2.250 

0.0 

2.250 

883 

1.000 

0.0 

6.250 
1 

2.250 

1.000 

o 

• 

o 

4.000 

0.250 




TABLE A-3.™ VARIANCE OF THE PCC USING 
60, 45, AND 30 STARTING DOTS 

(a) Using 60 starting dots 












TABLE A-3.~ Continued. 

(b) Using 45 starting dots 


Segment 

number 

Cluster 

Classify 

PCCl 

PCC2 

PCCl 

PCC2 

1075 

0.0 

9.663 

0.0 

20.410 

1341 

1.613 

2.613 

1.613 

2.823 

1591 

11.590 

0.093 

41.070 

0.120 

185 

6.750 

9.390 

1.763 

3.773 

804 

0.0 

2.203 

1.613 

2.203 

812 

6.453 

8.423 

5.063 

2.203 

1253 

4.840 

6,070 

6.453 

9.120 

1502 

0.0 

5.590 

0.0 

6.413 

1572 

1.763 

2.170 

1.843 

4.423 

146 

1.210 

0.250 

1.210 

0.360 

824 

1.210 

4.203 

1.210 

3.063 

883 

1.210 

4.410 

1.210 

10.890 














TABLE A-3.- Concluded. 

(c) Using 30 starting dots 



Cluster 

Classify 

PCCl 

PCC2 

PCCl 

PCC2 

1075 

0.0 

13.890 

0.0 

34.443 

1341 

25.963 

22.863 

25.853 

24.943 

1591 

3.630 

2.403 

3.630 

5,643 

185 

3.743 

19.253 

27.543 

23.543 

804 

0.0 

5.813 

0.0 

3.803 

812 

14.963 

7.680 

3.853 

2,110 

1253 

3.630 

12.213 

0.0 

7.453 

1502 

0.0 

11.543 

0.0 

13.043 

1572 

49.083 

1.203 

45.563 

1 .703 

146 

100.000 

17.640; 

44.223 

43.560 

824 

0.0 

1.000 

0.0 

3.802 

883 

0.0 

16.402 

2.723 

9.303 















TABLE A-4.- THE R-VALUES FOR 60, 45, AND 30 STARTING 
DOTS USING LABELING PROPORTIONS 

(a) R-values for 60 starting dots 


Sagflient 

Crop 1 

Crop 2 

[Hun (u) 

Cluster 

■ 0. 738621 variance (o‘) ■ 0.025] 

[Hun (u) 

Cliiilfy , 

■ 0.726061 virlinei (o') * 0.032] 

Humber 

Location 



*10 

*01 

II 

R 

*10 

*01 

— 

IT 

R 

107E 

Kibrnlui 

c 

n 



0.429 

0.766 

■E9 

0,622 

0.429 

Bn 





.125 

.479 

.462 

.617 


.436 

.462 

■on 




H 


.444 

.429 

.707 

.050 

.409 

.423 

.645 

1341 

Kantii 

c 

19 

.469 


.633 

.666 

,466 

.250 

.538 

.910 





.408 


.633 

.926 

.429 

.286 

.633 

.917 





.460 


.624 

,961 

,447 

.346 

,539 

.466 

1691 

Nebrasita 

n 

£ 

.059 

.688 

.167 

.633 

.071 

.568 

.168 

.855 





.076 

.577 

,248 

,835 

,077 

,560 

.243 

,824 



H 


.111 

.435 

.221 

.787 

.062 

,500 

.214 

.750 

146 

Kentucky 


X 

.164 

.155 

.690 

.561 

.154 

.172 

.690 

.588 



H 


.344 

.123 

.640 

.697 

.313 

.140 

.640 

.591 

18S 

Hinnesota 

^9 

X 

.043 

.667 

.114 

.879 

.044 

.667 

.118 

.879 





.011 

.867 

.143 

.934 

,011 

.067 

,144 

.934 





.034 

.936 

,152 

.997 

.023 

,934 

.164 

,992 

804 

towa 

c 

X 

.269 

.069 

,752 

.606 

.269 

.114 

.752 

.637 





.233 

,107 

.714 

.573 

,267 


.712 

.685 





.194 

.054 

.706 

.411 

.194 


.072 

,413 

B1E 

Hlsilisippi 

X 

K 

.191 

.190 

.552 

,619 

.174 

.190 

.556 

.599 





.136 


.661 

.666 

.093 

,180 

,587 

.487 





.250 


.505 

.586 

.250 

.113 

.605 

,586 

6E4 

Illinois 

c 

X 

.778 

.014 

.686 

.880 

.778 

.014 

,888 

.880 





.500 

.011 

.698 

.617 

.500 

.011 

.897 

,616 

883 

iQwa 

■9 

X 

.308 

.243 

.729 

.626 

,269 

.267 

,729 

.613 





,300 

.196 

.651 

.753 

,241 

.196 

,659 

.700 

1253 

Oklahoma 

^9 

A 

.016 


.377 

,419 

.046 

.103 

.375 

.268 







.429 

.500 

.102 

.156 

.433 

.446 



H 


mm 

,176 

.324 

1 .622 

,104 

,147 

.337 

, .454 

1502 

Colorado 


V 

.055 

.714 

.133 

.921 

,044 

.667 

,117 

1 .878 





.000 

.737 

1 .181 

.778 

,000 

.737 

.165 

,775 




1 

.068 

.412 

,162 

1 .716 

.069 

.412 

.163 

.717 

1572 

Nebraska 

c 

1 p 

.636 

,091 

1 .733 

■ .823 

1 .571 

.078 

.733 

' .630 





.867 

.013 

.714 

I .936 

1 .867 

.013 

1 .714 



i 



.455 

,146 

.790 

, .860 

.455 

.169 

1 .790 

mm\ 


Symbol definitions: 

R « reduction cotfficlent 


7T » the probabITIty a pixel is labeled wheat 

itjq * the probability a pixel is classified wheat and labeled nonwheat 

WqI ■ the probability a pixel is classified nonwheat and labeled wheat 

Crop codes 

A ^ alfalfa 

C • corn 

E " sorghiiff 

K ” cotton 

P > pasture 

X » soybeans 

V * sugar beets 



























TABLE A-4.- Continued. 

(b) R-values for 45 starting dots 


Sigeient 

Crop 1 

Crop 2 

[Hiin (m) ■ 

Clutter * 

0.73476t virlence * 0.032] 

[N*in (w) 

Clmlfy , 

0,72916t v«runi. (a*) ■ 0,030] 

NiMbir 

Location 



*10 

*01 

n 

R 

*10 

*01 

e 

R 

I07S 

Htbraske 

C 

BQII 

o.oso 

0.578 

0.429 

0.79S 

0.068 

0.591 

0.427 

0,831 





.125 

.438 

.462 

.764 

.071 

,426 

.456 

.701 




■ 

.100 

.378 

.429 

,695 

.067 

.306 

.423 

.651 

1341 

Kansas 

c 

■■ 

.551 

.232 

.633 

.947 

.469 

*218 

,539 

.907 





.367 

.339 

.633 

.914 

.347 

.339 

.533 

,902 





.560 

.346 

.524 

,991 

.417 

.309 

.634 

.924 

1591 

Ktbraska 

c 

E 

.106 

.688 

.167 

,902 

,094 

.647 

.167 

*921 





.063 

.654 

.248 

.871 

,038 

.664 

.248 

.825 





.099 

.622 

,221 

,834 

,075 

.500 

,216 

,780 

146 

Kentucky 

c 

n 

.192 

,121 

.690 

.546 

.231 

,105 

,687 

.559 




H 

.219 

.140 

.640 

.594 

.260 

.105 

.640 

,671 

185 

HInnasoU 

c 

■■ 

.066 

.583 

.114 

.667 

,067 

.683 

■nn 

.867 





.011 

.800 

.143 

.680 

.on 

.800 

WSnm 

,880 





.079 

.875 

.152 

.997 

.057 

.875 

.155 

.990 

8M 

Iowa 

c 

X 

.231 

.127 

.752 

.624 

.269 

.128 

,750 

,661 





.200 

.093 

.714 

.509 

.233 

,055 

.712 

,549 





.229 

.054 

.706 

.452 

,226 

.055 

,702 

,450 

812 

Hlsslsslppl 

X 

mm 

,213 

.172 

.552 

.622 

.196 

.138 

.558 

,554 





.182 

.148 

,581 

,554 

.140 

.230 

.587 

*615 




H 

.250 

.132 

.505 

.612 

.235 

.132 

.510 

.594 

824 

minols 

c 

■■ 

.770 

.014 


.660 

,778 

.014 

,888 

.880 





.400 

.011 

*898 

.519 

.300 

.023 

,896 

,496 

BB3 

Iowa 

c 

X 

.192 

*263 

.025 

.741 

,269 

.232 

.726 

,767 





.020 

.214 

.661 

.455 

.172 

,195 

.659 

.629 

1253 

Oklahoma 

X 

mm 

.136 

.100 

.377 

.439 

.109 

,100 

.385 

.389 





.067 

.218 

524 

,483 

.069 

.244 

.437 

.601 




H 

*127 

*176 

.324 

.534 

.132 

.121 

,327 

.477 

1502 

Colorado 

c 

mM 

,077 

.786 

*133 

,975 

.079 

.846 

.127 

.992 





,000 

.684 

,181 

.725 

.000 

.684 

.184 

,726 




H 

.000 

.412 

.162 

,742 

.080 

,412 

.163 

.742 

1572 

Nebraska 

c 


*714 

.065 

.733 

,912 

,579 

.065 

,733 

,889 





.833 

.013 

,714 

.910 

.628 

.014 

,718 

.908 





.645 

,181 

.790 

.931 

.545 

.157 

.790 

.914 


Symbol diflnltlons: 

H ■ reduction coefficient 


IT - the probebi lUy a pixet Is labeled tPfheat 

■ the probability a pixel U classified wheat and labeled nonwheat 
Hq] the probability a pixel Is classified norwheat and labeled wheat 
Crop codes 
A * alfalfa 
C ■ corn 
E ■ sorghum 
K * cotton 
P - pasture 
X soybeans 
Y ■ sugar beets 


























TABLE A-4.- Concluded. 

(c) R-values for 30 starting dots 


Segment 

Crop 1 

— 

Crop 2 

[Hnn iu) 

Clutttr . 

■ 0.77446t virlinc. (o*) ■ 0.029} 

[Kmr (m) 

C1,i$1fy , 

■ 0.74633t VirliiK. (o‘) ■ 0.034} 

Number 

Locitlon 




mm 


R 

*10 

"01 


8 

107S 

Ntbrttki 

C 

■H 


0,669 


0,077 

0.050 

0.682 

0.423 

0,872 





.196 

,333 

■Ki^m 

.773 

,126 

.340 

,466 

.696 




H 

.IB? 

.366 

.429 

.779 

,133 

.302 

.417 

.067 

134t 

Kirui 

c 

■■ 

.510 

.161 ; 

.467 

.879 

,511 

.143 

,644 

,859 






.339 

.633 

.676 

.245 

.321 

.633 

.612 





.660 

.273 

,524 

,997 

,688 

,236 

.534 

,993 

1691 

Ntbntkd 

c 

E 

.116 

.706 

.167 

.966 

.096 

.547 

.170 

.922 






.654 

,248 

.889 

.064 

.640 

.243 

.663 





.086 

.739 

.221 

.952 

.000 

.696 

.223 

,931 

146 

Kentucky 

n 

■■ 

.385 

.156 

.690 

,783 

.462 

.158 

.687 

.845 





.313 

.105 

.640 

.638 

.219 

.105 

.640 

.536 

185 

Minnesota 


■■ 

.075 

.633 

.114 

.989 

.056 

.833 

.119 

.900 





.044 

.800 

.143 

.952 

,035 

,800 

,150 

.938 



H 

H 

.101 

1.000 

.152 

.983 

.112 

.938 

.162 

.997 

804 

lOM 



,269 

.177 

,752 

,734 


,143 

.748 

.719 





.300 

.067 

.714 

.661 

.276 

.068 

.716 

.639 



H 

H 

.258 

•041 

.705 

.462 

.233 

.027 

,709 

,366 

812 

Hissiflippi 

IH 


.170 

.269 

.552 

.677 

.109 

.241 

.666 

.532 





.205 

.116 

,561 

.531 

.196 

,197 

.698 

.640 



H 


.308 

.075 

.60S 

.596 

.294 

.038 

,610 

.619 

824 

Illinois 


X 

.889 

.014 

.886 

.961 

,889 

.014 

,388 

.961 





.500 

.046 

.696 

.772 

.400 

.047 

,895 

.694 

B83 

lows 

c 

■■ 

.346 

,157 

.729 

,764 

.280 

,174 

,734 

,732 




IH 

.267 

.066 

,659 

.558 

.241 

.070 

,663 

,497 

1253 

Oklahoma 



.121 

.126 

,377 

.446 

.106 

,160 

.381 

.455 





,100 

.269 

.429 

.604 


.289 

,433 

.680 




H 

.127 

.176 

.324 

.534 

.132 

.121 

.327 

,477 

1502 

Colorado 


WM 

.066 

.667 

.133 

.990 

,069 

.917 

,121 

.9996 





.000 

•60« 

.181 

,725 


,684 

1 UM 

.726 




H 

.046 

.353 

.162 

.597 

.048 

,313 

mm 

,569 

1572 

Nebraska 


n 

.821 

.000 

.733 

.862 

.760 

,000 

,733 

,804 





. 8 fJ? 

.027 

,714 

.958 

.862 

.027 

,718 

,955 




HBH 

,455 

.205 

.790 

.904 

.455 

.181 

.790 

.865 


Symbol dtfInUtons; 


R ■ reduction coefficient 
n * the probebfllty a pixel Is labeled Kheat 

■ the probability a pixel Is classified wheat and labeled nonwheat 
tT 0 ^ ■ the probability a pixel Is classified nonwheat and labeled wheat 

Crop codes 
A ■ alfalfa 
C ■ corn 
E « sorghum 
K 1 * cotton 
P • pasture 
X • soybeans 
V • sugar beets 
































TABLE A-5.-THE R-VALUES FOR 60, 45, AND 30 STARTING 
DOTS USING GROUND-TRUTH PROPORTIONS 

(a) R-values for 60 starting dots 


— ' "I 

Segment 

Crop 1 

Crop 2 

[Man (u) • 

Cluster 

0.73779s variance (a‘ 

') • 0.024] 

[Man (tt) • 

Clisilfy 9 

0, 731361 variance (o^) » 0,034] 

Number 

location 



*10 

*01 

If 

R 

*10 

*01 

TT 

ft 

107S 

Nebraska 

C 

mm 

0^033 

mmm 

0.37 

0,776 

0,067 

0.622 

0.37 

0.049 





-’25 

.479 


,815 

.107 

.436 


.758 




■ 

.067 

,444 


.701 

.050 

.409 


.636 

1341 

Kansas 

C 


.4S$ 

.023 

.64 

.664 

,468 

,250 

.54 

.910 





.408 

.321 


,926 

.420 

.266 


.917 





.460 

.346 


,962 

.447 

.345 


,956 

1S11 

Nebraska 

C 

£ 

.069 

.568 

.22 

.819 

.071 

.588 

.22 

,640 





.076 

.577 


.840 

,077 

.560 


.029 





.111 

.435 


.787 

.062 

.600 


,763 

146 

Kentucky 

c 

mm 

.164 

.165 

,63 

.640 

.154 

.172 

.63 

.666 




H 

.344 

.123 


.696 

.313 

.140 


.690 

’OS 

Hlnnesota 

c 


.043 

.667 

.16 

.861 

.044 

.667 

.16 

.063 





.011 

.867 


.932 

.011 

.667 


.932 




H 

.034 

.938 


.997 

.023 

.934 


.991 

Q04 

lOWf 

n 


.269 

.089 

,76 

.587 

.269 

.114 

.75 

.636 





.233 

.107 


.688 

.267 

.095 


,698 



H 

H 

,194 

.054 


.422 

.194 

.055 


.425 

812 

Klsslsslppl 



.191 

.190 

.55 

.619 

.174 

.190 

.55 

.599 





.136 

.197 


.560 

.093 

.160 


.477 



■ 


.250 

.113 


.581 

.260 

.113 


.581 

624 

Illinois 


% 

.778 

.014 

.93 

.698 

.778 

.014 

.93 

.997 



H 


,500 

.011 


.640 

,500 

.011 


.640 

8B3 

Iowa 


wm 

.308 

.243 

.70 

.820 

.269 

.257 

.70 

.803 



H 

H 

.300 

,196 


.765 

.241 

.196 


.713 

12S3 

Oklahoma 



.106 

.125 

.36 

.423 

.046 

.103 

.36 

.269 





,117 

,178 


.509 

.102 

.156 


.455 




H 

.183 

.176 


.610 

.104 

.147 


.448 

1502 

Colorado 

c 


.055 

.714 

.15 

.917 

.044 

.667 

.15 

.866 








.767 


,737 


,767 




H 

.068 



.723 

,069 

.412 


,726 

1572 

Nebraska 

n 


.636 

.091 

.7.1 


.571 

.078 

.78 

,839 





.867 

.013 



.667 

.013 


.935 



HH 

nil 

.465 

.146 


.840 

,455 

.169 


.071 


Symbol deffnitlons; 

R ■ reduction coefficient 


7 " the probability a pixel Is labeled Mtieat 

itqi ■ the probability a pixel Is classified wheat and labeled nonwfieat 
> the probability a pixel 1$ classified nonwheat and labeled i4ieat 

Crop codes 
A alfalfa 
C • corn 
£ ■ sorghum 
K ^ cotton 
P ■ pasture 
X » soybeans 
Y • sugar beets 
































TABLE A-S.- Continued. 

(b) R-values for 45 starting dots 


^^gment 

Crop 1 

Crop 2 

[Ktio (n) ■ 

Cluit.r - 

Oi73579‘, v.rl.nGt (o‘) • 0.032] 

[H..n (ii) ■ 

Cl.itify . 

0.72982} virUnce (o'^J ■ 0.029] 

Nuater 

Location 



"lO 

"oi 

w 

A 

”to 

”oi 

TI 

R 

107$ 

Nebraska 

C 

nj 

O.OM ^ 

0.678 

0.37 

0.788 

0.06B 

0.591 

0.37 

0.827 





.125 

.430 


.782 

.071 

*426 


.691 




H 

.100 

.376 


.693 

.067 

.386 


*646 

IMl 

Kansas 

C 


.551 

.232 

.54 

.947 

.489 

*210 

*54 

.907 





.367 

.339 


,9U 

.347 

.339 


,902 





.560 

.345 


.991 

,417 

.309 


,924 

1591 

Nebraska 

c 

E 

.106 

.588 

.22 

,080 

*094 

.647 

.22 

.910 





.063 

.654 


.875 

.038 

*654 


.828 





.099 

.522 


.835 

.075 

.500 


.779 

146 

Kentucky 

c 

X 

.192 

.121 

,63 

.531 

.231 

.105 

.63 

.549 





.219 

.140 


.592 

.250 

.105 


.570 

16S 

Hinnesota 

c 

X 

,06S 

.583 

.16 

.844 

.067 

.583 

,16 

.847 





.011 

.800 


.879 

,011 

,800 


.879 





.079 

.875 


*996 

.057 

.076 


.990 

604 

Iowa 

c 

X 

.231 

.127 

.76 

.623 

.269 

.128 

,75 

.661 





.200 

.093 


.524 

*233 

*095 


.563 





.229 

.054 


*461 

.226 

*055 


.460 

812 

Mississippi 

X 

K 

.213 

,172 

.55 

.622 

.196 

.138 

.55 

*554 





.102 

.140 


.551 

*140 

.230 


.607 





.250 

.132 


.609 

,235 

*132 


.592 

824 

11 llnols 

c 

X 

.776 

,014 

.93 

.aga 

*778 

.014 

*93 

.898 





.400 

.011 


*544 

.300 

*023 


.544 

683 

Iowa 

c 

X 

.192 

283 

.70 

.765 

*269 

,232 

.70 

,778 





.020 

,214 


.501 

,172 

.196 


.646 

1253 

1 OklahoLia 

X 

A 

.136 

,100 

.36 

.444 

.109 

.100 

,36 

*396 


1 



.067 

.216 


.463 

*069 

.244 


,497 


1 



.127 

,176 


,524 

.132 

.121 


.465 

1502 

1 

Colorado 

c 

Y 

.077 

.785 

.1^ 

.973 

.079 

,84C 

.15 

,991 





.000 

.664 


.718 

.000 

*684 


,718 





.080 

*412 


,750 

.080 

.412 


.750 

1572 

' Nebraska 

mm 

P 

.714 

.065 

.78 

.917 

.679 

.065 

.78 

*695 





.833 

.013 


*909 

.820 

.014 


,906 



Hi 


.545 

.181 


.930 

*545 

.157 


.912 


Symbol doftnltlons: 

R « reduction coefficient 


If « the probabnUy a pixel Is labeled wheat 

if|Q • the probability a pixel Is classified wheat and labeled nonwheat 
* the probability a pixel Is classified nonwheat and labeled wheat 

Crop codes 
A ■ alfalfa 
C • corn 
t sorghum 
K - cotton 
P ^ pasture 
X soybeans 
Y * sugar beets 
























TABLE A-5.- Concluded. 

(c) R- values for 30 starting dots 


& 

epent 

Crop 1 

Crop 2 

[Htin Cii) ■ 

Cluittr . 

0, 77609; virlinct ■ 0.029] 

[Hem (m) • 

Classify 

0.74636t variance 

) • 0.034] 

Ntmbtr 

Location 



"10 

"oi 

I 

R 

"10 

"01 


R 

1075 

Nebraska 

n 

m 

0,050 

0.609 

0.37 

0.873 

0,050 

0.682 

0.37 

0.868 





J96 

,333 


.778 

.125 

.340 


.695 



H 

H 

.103 

,356 


.783 

.133 

.302 


.669 

1341 

Kansas 


n 


.161 

.54 

.875 

.511 

,143 

.54 

.859 






.339 


.075 

.245 

.321 


.813 



H 


,680 

.273 


.997 


,236 


.993 

1591 

Nebraska 


E 

.118 

.70S 

.22 

.960 

.096 

.647 

.22 

.912 





,076 



,093 

.064 

.640 


.866 



H 


.086 



.952 

.088 

.696 


,932 

M6 

Kentucky 

^9 

X 

.385 

.155 

.63 

,775 

.462 

.150 

.63 

,839 





.313 

,106 


.637 

.219 

.105 


,535 

185 

Hlnnesota 

c 

X 

.075 

.833 

.16 

.986 

.056 

,033 

.16 

.976 





.044 

.800 


.949 

.035 

.800 


.937 





.101 



.982 

.112 

.938 


.996 

004 

towa 

c 

X 

.269 

.177 

.75 

.734 

.300 

.143 

.75 

.720 





.300 

.067 


.569 

.276 

,060 


.547 





.258 

.041 


.456 

.233 

.027 


.305 

B12 

Hississippi 

X 

K 

.170 

.269 

.55 

.677 

.109 

.241 

.56 

,580 





.205 

.115 


.531 

.195 

.197 


.633 





.308 

.075 


.587 

.294 

.038 


.507 

B24 

minots 

c 

X 

.809 

.014 

.93 

.970 

.809 

.014 

,93 

,970 





.500 

.045 


.810 

.400 

,047 


.746 

B03 

Iowa 

c 

X 

.346 

.157 

.70 


.280 

.174 

,70 

.720 





.267 

.086 



.241 

.070 


,602 

1253 

Oklahoma 

% 

A 

.121 

.125 

.36 

,451 

.108 

.150 

.36 

.459 





.100 

.289 


.605 

.085 

.289 


.578 





.127 

.176 


.524 

.132 

.121 


,466 

1502 

Colorado 

c 

Y 

.066 

.857 

.15 

.909 

.069 

,917 

.15 

.9996 






.684 


.718 

.000 

,864 


.718 





.046 

.353 


,605 

.048 

.313 


,577 

15^2 

Nebraska 

n 

P 

,821 

.000 

.78 

.855 

.750 

.000 

.78 

.794 





.067 

.027 


.960 

.662 

.027 


.957 



9^1 


.455 

.205 


.902 

,455 

.101 


.882 


Syniiol deftnUtons; 

R ■ reduction coefflcUnt 


■ the probability a pixel Is labeled idieat 
ri|Q ■ the probability a pixel Is classified wheat and labeled normheat 
the probability a pixel Is classified nonwhoat and labeled wheat 

Crop codes 
A - alfalfa 
C • com 
E > sorghum 
K ■ cotton 
P - pasture 
K ■ soybeans 
Y • sugar tieets 


































